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ONR  Grant  N00014-94-1-0794  \ 

Principal  Znvaatigator:  N.  F.  Janowitz 

The  conference  was  held  October  14-17,  1993,  There  were  a 
total  of  59  participeuits  from  10  different  countries.  The  focus 
of  the  conference  centered  on  the  following  topics : 

1,  What  is  the  nature  of  ordinal  data? 

2,  How  does  ordinal  data  arise  euid  what  cure  its  uses? 

3 ,  What  techniques  have  been  used  in  the  past  to  help 
interpret  various  types  of  ordinal  data?  ■ 

4 ,  What  directions  should  be  taken  by  future  research? 

The  conference  was  highly  successful  for  a  ntunber  of  reasons , 
First  of  all,  the  number  of  attendees  was  small  enough  so  that 
people  got  to  know  each  other  euid  had  ample  opportunity  for  both 
formal  cuid  informal  discussions.  This  was  further  enheuiced  by  the 
fact  that  several  breaks  were  scheduled  during  the  day,  euid  that 
meals,  residence,  and  the  meeting  rooms  were  all  located  in  the 
same  building.  There  were  people  from  a  variety  of  disciplines, 
and  m2uiy  different  views  of  the  stibject  emerged.  The  discussions 
were  spirited  but  constructive,  euad  many  people  have  since 
written  to  say  how  successful  they  thought  the  conference  was. 
Indeed,  there  are  informal  plans  for  holding  a  third  such 
conference  in  France  in  1996  under  the  sponsorship  of  INRIA,  and 
a  fourth  conference  in  Russia  in  1998  to  be  sponsored  by  the 
research  group  headed  by  Fuad  Alekserov. 

I  am  enclosing  copies  of  the  list  of  participeuits ,  the  program 
for  the  meeting,  cUid  the  aibstracts.  It  should  be  noted  that 
thanks  to  the  funding  provided  by  the  Grant,  we  were  eUale  to 
attract  some  of  the  top  names  in  the  World  who  work  in  this  area. 
Especially  worthy  of  mention  are; 

Edwin  Diday  of  France 

J.-P.  Doignon  of  Belgium 

Pierre  Heuisen  of  Ceinada  94*  1 5420 

Fred  Roberts  of  Rutgers 

Rudolf  Wille  of  Gemumy 

From  the  point  of  view  of  the  research  being  supported  by  ONR 
Grauit  N00014-90J-1008,  it  is  worthy  of  note  that  the  conference 
gave  a  meurvelous  opportunity  to  showcase  the  work  being  done 
under  this  Gramt,  and  to  attract  the  interest  of  both  research 
mathematic ieins  and  journal  editors  in  what  is  being  done. 

It  should  also  be  mentioned  that  there  was  a  short  course 
preceding  the  conference  devoted  to  software  euid  techniques 
developed  in  Darmstadt  for  processing  ordinal  data.  This  course 
attracted  a  number  of  people  from  Industry,  as  well  as  a  sizable 
contingent  from  our  Department  of  Sociology.  A  total  of  19  people 
participated  in  the  short  course. 
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on 

Ordinal  Data  Analysis 

Department  of  Mathematics  and  Statistics 
University  of  Massachusetts 
Amherst,  MA  01003,  USA 


Organizing  Commitee:  Local  Contact: 

Melvin  F.  Janowitz  (Amherst,  Massachusetts)  Professor  Melvin  F.  Janowitz 

Ivan  Rival  (Toronto,  Ontario)  Department  of  Mathematics  and  Statistics 

Fred  Roberts  (New  Brunswick,  New  Jersey)  University  of  Massachusetts 

Selma  Strahringer  (Darmstadt,  Germany)  Amherst,  MA  01003,  USA 

Rudolf  Wille  (Darmstadt,  Germany) 

^  Phone:  (413)-546-2871 

E-mail:  ordinalQmath.umaas.edu 

General  Information: 

The  conference  will  take  place  on  the  9th  floor  of  the  Campus  Center.  Plenary  lectures  and 
Session  I  will  be  in  Room  917,  Session  II  in  Room  911.  Coffee  and  refreshments  are  served 
during  the  breaks  in  Room  917. 

Registration: 

Thursday:  8.30  -  9.30  a.m.  in  front  of  Room  162  of  the  Campus  Center 
FViday:  8.00  -  9.00  a.m.  in  the  9th  floor  lobby  of  the  Campus  Center 


Preliminary  Events: 


Thursday,  October  14 


Short  Course  on  Formal  Concept  Analysis  held  by  Rudolf  WUle  (Darmstadt,  Germany)  j 

in  Sx>om  162  of  the  Campus  Center 

in  the  PC-Lab,  Room  01  (Basement)  of  the  School  of  Management  (SOM) 

WSLCOUING  RICBPTION  IH  RooU  162  O?  THB  CaIIFUS  CBKTBB. 

Saturday,  October  16 


■ 

Chair:  Mary  K.  Bennett 

1 

■ 

David  J.  Foulis  (Amherst,  Massachusetts) 

Test  Spaces  and  Data  Acquisition 

i 

1 

1 

9.45 

Uta  WUle  (Giessen,  Germany) 

Geometric  Representation  of  Ordinal  Contexts 

! 

i 

: : i 

10.30 

CoPFBE  Break  ! 

! 

Chair:  Fred  5.  Roberts 

1 

11.30 

1 

Jeau-Paul  Doignon  (Brussels,  Belgium) 

Unimodality  and  Straightness  of  Multipreference  Data 

j 

12.15 

1 

1 

1 

Don  Dearholt  (Starkville,  Mississippi)  | 

An  Introduction  to  Pathfinder  Ifetworks,  Proximity  Graphs  and  Applications  | 

1.00 

Lunch  for  the  Participants  in  Room  1001  1 

j 

Section  I 

section  n  1 

Chair:  Don  Dearholt 

Chair:  Bernhard  Monjardet  1 

2.30 

Ken  Papp  (Las  Cruces,  New  Mexico) 

Predicting  Human  Memory  and  Judgement 
Time  from  Ordinal  Measures  of  Psychological 
Proximity 

Audrey  V.  MaUshevaki  (Moscow,  Russia)  '! 
Generalisations  of  Transitivity  for  Relations  Be-  | 
tween  Sets  ;j 

1^^ 

John  P.  Sutcliffe  (Sydney,  Australia) 

Weight  of  Evidence  for  Strict  Simple  Ordering 
in  Pair  Comparison  Data  Subject  to  Error 

Norman  Cliff  (Los  Angeles,  California)  ! 

Ordinal  Alternatives  to  Mean  Comparisons 

1 

3.45  1 

Coffee  Break  j 

Section  I 

Section  11 

Chair:  BerthoU  Sehweiaer 

Chair:  Jean-PauL  Doignon 

4.15 

Norbert  Spaagenberg  (fVankfhrt,  Germany) 
Implicit  Knoededge  of  Psychoanalysis  Concern¬ 
ing  the  Suitability  of  Patients  for  Psychotherapy 

James  McIntosh  (Montreal,  Quebec)  j 

Multivariate  Analysis  of  Ordered  Categorical ! 
Data  1 

4.45 

Ingo  Leonhardt,  Michaela  Winterberg 
(Darmstadt,  Germany) 

CONCLASS  -  CONceptnal  CLustering  Assis¬ 
tant 

Robert  Jamison  (Clemson,  South  Carolina)  j 
Embedding  Graphs  in  Disjunctive  Products  of  ; 
Lines 

5.IS 

Tammo  Rock  (Daxmitadt,  Germany) 
TOSCANA’l  ~  A  Management  System  for  Con¬ 
ceptual  Data 

Piero  PagHani  (Rome,  Italy) 

A  Logic- Algebraic  Approach  to  Formal  Contexts  ! 
with  Partial  Information  | 
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Solving  the  Complete  Model  of  the 
Law  of  Comparative  Judgment 


Joseph  L.  Balloun 


Dept,  of  Management  and  Marketing 
Louisiana  Tech  University 
P.O.  Box  10318 
Reston,  LA  71272,  USA 


This  paper  presents  a  numerical  procedure  for  estimating  the  latent  parameters  of  the  Thurstonian 
law  of  comparative  judgment  for  data  obtained  by  a  common  experimental  procedure.  For  commonly 
encoxmtered  sample  sizes,  jackknife  analyses  allow  estimates  of  the  standard  errors  and  confidence 
limits  for  the  estimated  parameters,  and  the  overall  goodness-of-fit  function  may  be  tested. 

Past  approaches  to  solving  the  Thurstonian  law  of  comparative  judgments  have  encountered  two  sets 
of  challenges.  First,  the  complete  model  involves  more  unknowns  to  be  estimated  than  the  number  of 
observed  equations  available  from  experimental  data.  Second,  the  experimental  judgment  procedures 
commonly  used  yield  restricted  information  about  the  objects  (‘‘stimuli”)  to  be  measured,  xmder  con¬ 
ditions  not  readily  generalizable  to  the  situation  where  the  stimxili  will  be  used  for  measurement. 

There  are  at  least  three  strategies  for  reducing  the  two  major  barriers  to  solving  the  complete 
Thurstonian  law  of  comparative  judgments.  First,  past  efforts  have  restricted  the  effective  nxunber 
of  parameters  to  be  estimated  by  making  restrictive  assumptions  about  their  values.  Second,  it  is 
possible  to  wrest  more  observed  equations  &om  experimental  data.  Finally,  a  different  experimental 
design  may  yield  more  information  (direct  estimates  or  more  equations)  about  the  latent  parameters. 

In  the  social  sciences,  a  common  experimental  procedure,  and  the  one  usximed  in  this  paper,  is  to 
have  people  respond  to  several  different  stimuli  which  are  intended  to  measure  one  construct  or  con¬ 
cept.  Examples  include  attitude  sxirveys  or  academic  tests.  Such  a  design  allows  direct  estimation 
of  the  correlations  among  people  or  among  stimxili.  Moreover,  paired  comparisons  between  people 
and  between  stimuli  can  be  computed  from  the  observed  data  matrix.  The  marginal  means  of  people 
and  stimuli  proxride  additional  observation  equations.  The  various  first  derivatives  of  the  observed 
equations  relative  to  the  remaining  unknoxvn  parameters  can  be  foxmd,  and  a  conjugate  gradient 
method  can  be  used  to  estimate  the  latent  locations  and  "discriminal  dispersions”  of  the  stimuli  and 
the  people  in  a  common  space. 

The  proposed  method  is  demonstrated  with  an  artificial  data  set,  and  with  data  &om  an  academic 
intelligence  test.  Some  advantages  and  challenges  to  the  experimental  method  and  to  the  numerical 
analysis  method  are  discussed. 
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Representation  of  Binary  Data  by  Median  Semilattices 


Jean-Pierre  Barthelemy 

Laboratoire  d’lutelligence  Artificielle  et  Systems  Cognitifs 
Ecole  Nationale  Superieure  des  Telecommunication  a  de  Bretagne 
BP  832-29285  BREST  Cedex,  France 


A  Guttman  scale  allows  for  ordering  objects  (or  items  or  stimuli)  described  by  “presence-absence” 
attributes  (only  one  value:  “presence”  is  taken  into  account).  We  study  some  generalizations  of 
Guttman  scales  in  the  case  of  binary  data  (the  two  modalities  have  to  be  considered).  We  emphasize 
the  use  of  median  semilattices  to  represent  such  data. 
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Multivariate  Analysis  of 
Ordered  Categorical  Data 

Jon  A.  Breslaw 

Department  of  Economics 
Concordia  University 
1455  De  Maisonneuve  Blvd  W. 
Montreal,  Que  H3G  1M8,  CANADA 


The  problem  considered  in  this  paper  is  how  to  obtain  both  consistent  and  efficient  estimates  of  the 
parameter  vector  in  the  stochastic  regression  model, 

y*  =  x*0  +  fi 

when  only  categorical  responses,  {y,  x},  are  available  for  fdl  of  the  variables.  When  z*  is  observable 
the  appropriate  technique  is  the  standard  ordered  probit  model.  When  only  x  is  available,  replac¬ 
ing  the  exogenous  variables  by  dummies,  as  was  done  in  McIntosh  et  al  (1989),  is  not  satisfactory. 
Furthermore,  the  “latent  variable  approach”  has  proved  to  be  intractable  when  the  matrix  of  poly- 
choric  correlation  coefficients  is  not  positive  definite.  Thus  there  is,  at  present,  no  technique  which 
adequately  deals  with  the  multivariate  nature  of  the  problems. 

Our  approach  generalizes  a  two-stage  procedure  proposed  by  Ross  and  Zimmermmn  (1987)  using 
the  conditional  moments  of  x*.  The  first  stage  involves  the  estimation  of  the  matrix  of  delineation 
points  and  the  variance-covariance  matrix  for  the  distribution  of  x*.  A  one-factor  analytic  multi¬ 
variate  normal  distribution  is  employed,  and  this  permits  the  easy  and  efficient  estimation  of  the 
distributional  parameters  even  when  the  number  of  explanatory  variables  is  greater  than  three.  Us¬ 
ing  conditional  means  generated  from  the  first  stage,  a  semi-parametric  method  is  used  in  the  second 
stage  to  estimate  the  0  parameters.  The  method  is  applied  to  a  sample  of  Canadian  firms  which 
replied  to  a  Conference  Board  of  Canada  survey  questionnaire  on  investment  intentions. 


Sacond  Intamational  Confaranc*  on  Ordinal  Data  Analyaia.  Univaraiiy  of  MaaaachuaatU,  Amharat,  MA  01003.  Octobar  15-17,  1993 


Ordinal  Alternatives  to  Mean  Comparisons 


Norman  Cliff 

Department  of  Psychology 
University  of  Southern  California 
Los  Angeles,  CA  90089-1061,  USA 


An  obvious  zdternative  to  comparing  the  location  of  two  distributions  by  comparing  their  means 
is  to  use  a  procedure  based  on  ranks.  A  widely  known  procedure  that  does  not  require  that  the 
distributions  be  the  same  except  for  location  is  the  delta  measure,  using  the  Zaremba-Fligner-Policello 
estimate  of  the  variance  of  the  estimate  of  delta,  but  it  is  restricted  to  independent  groups  data. 
However,  the  variance  of  the  delta  estimate  in  the  case  of  paired  observations  can  be  defined  and 
used  as  the  basis  for  ordinal  inferences  with  a  repeated  measure,  alternative  to  the  paired-observations 
t-test.  It  has  certain  conceptual  and  statistical  advantages  over  statistics  such  as  the  sign  test  and 
the  Wilcoxon  signed- ranks  test.  Some  examples  are  given,and  a  graphical  aid  to  interpretation  of 
these  statistics,  the  "dominance  diagram”,  is  suggested  for  both  independent  groups  and  paired 
observations  cases.  Ways  in  which  these  location-comparison  procedures  can  be  generalized  to  more 
complex  designs  are  suggested. 

In  a  separate  study,  the  size,  power,  and  coverage  of  the  Zaremba-Fligner-Policello  method  for  making 
inferences  about  the  independent  groups  delta  measure  is  being  investigated.  Some  problems  with 
coverage  in  special  cases  have  arisen  due  to  the  correlation  between  the  estimate  d  and  its  estimated 
variance:  when  delta  happens  to  be  overestimated,  its  variance  is  imderestimated,  and  a  symmetric 
confidence  interval  tends  to  be  too  short  on  the  side  nearer  zero.  A  modification  that  improved 
coverage  was  found.  It  is  suggested  these  problems  are  similar  to  those  that  occ\ir  with  other 
botmded  measures  and  that  they  may  occur  rather  widely  with  ordinal  statistics  when  confidence 
intervals  are  used. 
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On  Level  Maps 
Frank  Critchley 

School  of  Mathematics  and  Statistics 
University  of  Birmingham 
Birmingham,  UNITED  KINGDOM 


A  series  of  classical  papers  in  the  1960’s,  notably  Benzecri  (1965),  Hartigan  (1967),  Jardine,  Jar- 
dine  and  Sibson  (1967)  and  Johnson  (1967),  established  that  the  ultramietrics  of  a  finite  set  S  can 
be  put  in  one-to-one  correspondence  with  either  the  dendograms  on  S  or  the  indexed  hierarchical 
classifications  on  5.  Several  extensions  of  these  bijections  have  been  given  in  the  literature.  In  par¬ 
ticular,  Jardine  and  Sibson  (1971)  generalise  from  the  ultrametrics  to  any  (symmetric)  dissimilarity; 
Janowitz  (1978)  and  Barthelemy,  Leclerc  and  Monjardet  (1984)  further  allow  the  dissimilarity  to 
take  values  not  only  in  the  nonnegative  reals  but  in  any  join  semi-lattice  with  a  smallest  element; 
and,  most  recently,  Critchley  and  Van  Cutsem  (1989)  deal  with  nonnegative- valued  dissimilarities 
which  are  not  necessarily  symmetric  (called  predissimilarities)  and  which  are  defined  on  an  arbitrary 
set  5. 

We  review  this  field  and  show  how  one  can  in  fact  produce  a  single  general  theorem  of  bijection 
between  the  predissimilarities  on  an  arbitrary  set  5  taking  values  in  an  arbitrary  ordered  set  L  and 
certain  increasing  families  of  subsets  ofSxS  indexed  by  L.  Indeed  this  theorem  is  in  turn  the  special 
c2Lse  E  =  5  X  5  of  a  natural  dual  order-isomorphism,  for  an  arbitrary  set  E,  between  all  functions 
/  :  E  — >  L  and  ail  so-called  “level”  maps  m  :  L  — *  V{E).  Note  that  there  are  no  constraints  on 
the  ordered  set  L.  In  particular,  L  need  to  be  totally  ordered. 

The  theory  and  applications  of  level  maps  are  briefly  indicated.  Several  alternative  characterisations 
of  level  maps  ctre  presented.  Some  of  these  involve  conditions  on  ordered  sets  that  appear  to  be 
original.  Their  relationship  with  residual  maps  is  explored.  Applications  include  certain  natural 
generalisations  of  standard  methods  of  dissimilarity  analysis.  In  particular,  to  asymmetric  and/or 
multiattribute  data. 


SKond  Intamatienal  Confartnca  on  Ordinal  Data  Analyait,  Univaraity  of  MaaaachuaatU,  Amharat,  MA  01003.  Octobar  lS-17.  1993 


Morphisms  of  Stability  Families 
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Many  authors  have  employed  the  idea  of  a  stability  family  to  study  various  kinds  of  consensus 
methods.  Let  X  be  a  set  on  which  a  consensus  method  is  to  be  applied.  A  stability  family  on  X  is 
a  pair  (5,7)  where  7  :  X  — ►  ^(*5)  ^  injective  function  from  X  to  the  set  of  subsets  of  5.  One 

can  think  of  the  set  7(x)  as  a  set  of  building  blocks  for  the  element  x  E  X.  Let  x,y  E  X.  Define 
I  <  y  if  and  only  if  7(1)  <  7(y).  If  V  is  a  finite  set  of  voters,  then  a  consensus  function  on  X  is 
a  function  F  :  X^  =>  X.  We  investigate  how  the  possible  consensus  methods  vary  with  different 
stability  families  which  induce  the  same  order  on  X.  To  facilitate  this  investigation,  various  types 
of  morphisms  of  stability  families  are  introduced  and  properties  of  consensus  functions  which  are 
preserved  by  these  morphisms  are  studied. 
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An  Introduction  to  Pathfinder  Networks, 
Proximity  Graphs  and  Applications 

Don  Dearholt 

Dept,  of  Computer  Science 
Mississippi  State  University 
Mississippi  State,  MS,  USA 


The  Pathfinder  network  model  for  human  semantic  memory  is  presented,  and  the  motivations,  ap¬ 
proaches,  and  research  objectives  of  the  paradigm  are  discussed.  Parameters  controlling  the  density 
of  edges  in  Pathfinder  networks  are  described,  and  the  control  of  violations  of  the  metric  axioms  are 
discussed.  Theoretical  results  include  the  construction  of  two  orthogonal  families  of  networks  associ¬ 
ated  with  a  given  dissimilarity  matrix,  the  capability  of  generating  single-link  hierarchical  clustering 
from  any  Pathfinder  network  (the  converse  is  not  true),  the  preservation  of  edge  structure  under 
multiplicative  transformations  of  the  dissimilarity  measures,  and  the  preservation  of  edge  structure 
under  monotonic  transformations  (preserving  order)  when  the  r-metric  is  infinity. 

Proximity  graphs,  which  require  a  coordinate  system,  are  briefly  introduced,  and  motivations  and 
applications  are  mentioned.  Relationships  between  Pathfinder  networks  and  some  proximity  graphs 
are  given,  and  a  new  family  of  proximity  graphs  derived  from  Pathfinder  networks  is  discussed. 

An  overview  of  some  applications  of  Pathfinder  networks  and  of  their  extensions  is  given.  These 
include  a  hypertext  browser  for  an  online  UNIX  help  system,  a  robotic  vision  database  system, 
and  modeling  dynamic  systems  xising  the  co-occurrence  of  entities.  The  latter  results  in  three  lev¬ 
els  of  representation:  imconstrained  by  either  partitional  or  hierarchical  constraints,  constrained 
by  the  generalized  triangle  inequality  (the  co-occtirrence  Pathfinder  network),  and  the  hierarchical 
constraints  resulting  in  single-link  clustering.  Some  situations  in  which  the  middle  level  of  repre¬ 
sentation  (the  co-occurrence  Pathfinder  network)  is  preferable  are  discussed,  and  aspects  of  learning 
and  consensus  are  presented  in  this  context.  Cumulative  consensus  is  defined  and  illustrated  using 
co-occurrence  Pathfinder  networks  and  the  scenarios  which  can  be  used  to  generate  them. 
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An  Introduction  to  Symbolic  Data  Analysis 

Edwin  Diday 
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The  main  aim  of  the  symbolic  approach  in  Data  Analysis  is  to  extend  problems,  methods  and 
algorithms  used  on  standard  data  to  more  complex  data  where  the  units  are  called  “symbolic  objects” , 
in  order  to  distinguish  them  from  objects  (described  by  numerical  or  categorical  variables)  treated 
by  standard  Data  Analysis  methods.  “Symbolic  objects”  extend  classical  objects  of  data  analysis  at 
least  in  two  ways:  first,  in  the  case  of  individuals,  by  giving  the  possibility  of  introducing  structured 
information  in  their  definition;  second,  in  the  case  of  sets  or  classes,  by  being  “intensionally”  defined. 
In  both  cases,  in  order  to  represent  imcertain  knowledge,  it  may  be  useful  to  use  probabilities, 
possibilities  (in  case  of  vagueness  and  imprecision  for  instance),  belief  (for  probabilities  only  known 
on  parts  and  to  express  ignorance);  that  is  why  we  introduce  several  kinds  of  sjrmbolic  objects: 
boolean,  possibilist,  probalilist,  and  belief.  We  briefly  present  some  of  their  qualities  and  properties; 
three  theorems,  show  how  Probability,  Possibility  and  Evidence  theories  may  be  extended  to  these 
objects.  Some  mixture  decomposition  problems  on  these  objects  are  settled.  We  show  that,  in  some 
cases,  fractals  are  well  adapted  to  representing  duality  between  s3rmbolic  objects.  Sets  of  symbolic 
objects  are  represented  by  categories  of  different  kinds  (hierarchies,  pyramids,  and  lattices).  Four 
kinds  of  data  analysis  problems  including  the  symbolic  extension  are  illustrated  by  several  algorithms 
which  induce  knowledge  firom  classical  data  or  from  a  set  of  symbolic  objects.  Finally,  important  steps 
of  a  symbolic  data  analysis  are  described  and  illustrated  by  an  example  concerning  road  accidents. 
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Testing  Unimodality  and  Straightness 
of  Multipreference  Data 

Jean-Paul  Doignon 

Department  of  Mathematics 
Universite  Libre  de  Bruxelles 
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Suppose  that  the  objects  forming  a  finite  set  X  are  ranked  according  to  personal  preference  by  the 
subject  from  a  finite  population  A.  Without  loss  of  generality,  it  is  assumed  that  the  (simple)  orders 
<a  thus  produced  by  the  subjects  a  from  A  are  all  distinct.  Then  M=  {X,  {<a|  a  G  A})  constitutes 
a  midUpreference  system. 

The  unfolding  model  of  Coombs  [3]  (see  also  Falmagne  and  Doignon  [5])  offei»  an  explanation  of 
such  data  on  the  basis  of  relative  proxinoity  in  some  embedding  ‘psychological’  space.  Formally,  any 
multipreference  system  M  admits  an  Euclidean  r-representation  in  the  following  sense:  there  exist 
two  mappings  f  on  A  and  g  on  X,  both  to  some  r-dimensional  Euclidean  space  S',  and  such  that 
for  a  €  A  and  i,j  €  X: 

*  <•  i  ^  l/(a)  -  ^(»)l  <  l/(«)  - 

Several  computer  programs  are  available  for  tentative  construction  of  Euclidean  r-representations 
with  lower  values  of  r  (see  e.g.  Borg  and  Lingoes  [2]).  However,  given  any  r  in  {1,2, . . no  neces¬ 
sary  and  sufficient  conditions  seems  to  be  known  for  the  existence  of  an  Euclidean  r-representation. 

A  recent  paper  by  Doignon  and  Falmagne  [4]  describes  a  polynomial-time  algorithm  that  tests 
whether  a  multipreference  system  admits  a  1-representation.  The  basic  idea  is  to  first  check  that 
two  necessary  conditions,  called  ‘unimodality’  and  ‘straightness’,  axe  fuUfiUed  by  the  system.  When 
they  are,  the  problem  is  then  reduced  to  an  existence  problem  for  a  finite  set  of  linear  inequalities. 

The  talk  will  focus  on  these  two  conditions  and  the  related  sub-algorithms.  Unimodality  plays  also 
a  role  in  voting  theory.  The  algorithm  we  present  to  test  unimodality  is  better  than  a  previous, 
indirect  one  by  Bartholdi  and  Trick  [1],  at  least  for  worst-case  execution  time,  and  also  for  insight 
in  the  concept  itself.  The  notion  of  straightness  will  be  introduced  in  a  nice,  geometric  setting,  by 
looking  at  the  so-called  permutohedron.  The  resulting  algorithm  is  then  compared  with  work  done 
by  Romero  [6]. 
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Test-Spaces  and  Data  Acquisition 
David  Foulis 
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Test  spaces  generalize  the  standard  notion  of  a  sample  space,  but  provide  for  situations  in  v^hich  there 
are  several  tests  or  measurements,  some  of  which  may  be  incompatible.  Two  tests  are  incompatible  if 
they  cannot  be  performed  simultaneously  without  interfering  with  each  other.  The  formal  deflntion 
of  a  test  space  is  deliberately  made  as  general  as  possible  to  accomodate  a  wide  range  of  applications. 
Data  acquired  from  executing  the  tests  are  provided  with  structure  inherited  from  the  patterns  formed 
by  the  test-patterns  which  become  significant  for  problems  of  statistical  inference. 
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Cluster  Analysis:  A  Point  of  View  for  Applications 
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Cluster  Analysis  is  a  term  used  to  describe  loosely  connected  sets  cf  concepts  criteria  and  algortithms 
for  discovering  and  defining  subgroups  in  observational  data. 

There  are  different  ways  of  looking  at  methods  of  Cluster  Analysis.  One  is  to  understand  the 
individual  methods.  Another  is  to  understand  the  process  of  using  the  methods  in  problem  solving. 
The  purpose  of  this  talk  is  to  focus  on  the  use  of  cluster  analysis  in  concert  with  related  methods  of 
data  analysis  in  problem  solving. 

At  present  there  is  limited  theory  and  formalism  for  directing  the  process  of  using  clustering  and 
related  methods  of  data  analysis  in  practice.  There  is,  however,  a  reasonable  consensus  that  any 
rational  approach  to  applications  should  consider:  purpose,  choice  of  objects,  type  of  data  structure, 
nature  of  the  problem,  selection,  transformation  and  scaling  of  variables,  choice  of  algorithm,  criteria 
of  evaluation,  and  appropriate  communication  of  applicability  of  the  proposed  classifications  and 
models. 

Suggestions  for  using  the  context  of  problems  to  guide  the  user  in  making  choices  within  this  frame 
work  will  be  provided.  Recommendations  for  software  and  computing  environments  will  be  given. 
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Sequential  Single  Cluster  Clustering  with  Ordinal  Data 
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Sequential  Single  Cluster  Clustering  proceeds,  given  a  set  of  entities,  by  finding  a  subset  (or  cluster) 
of  them  which  is  homogeneous  and/or  well  seperated,  removing  the  entities  of  that  cluster  and 
iterating  until  no  further  cluster  can  be  found.  In  the  case  of  ordinal  data  (i.e.,  when  only  a  ranking 
of  the  dissimilarities  between  pairs  of  entities  is  given)  a  criterion  of  homogeneity  is  the  split  of  the 
cluster,  or  minimum  dissimilarity  between  an  entity  inside  this  cluster  and  one  outside  of  it;  criteria 
for  homogeneity  are  (i)  the  radius  of  the  cluster,  or  maximum  dissimilarity  between  an  entity  of 
the  cluster  and  its  center  (also  an  entity,  chosen  to  minimize  the  value  of  the  radius)  and  (ii)  the 
diameter  of  the  cluster,  or  largest  dissimilarity  between  a  pair  of  entities  of  this  cluster.  Algorithms 
are  proposed  (or,  in  one  case,  reviewed)  to  maximize  split,  minimize  radius  or  diameter  for  all  possible 
cardinalities  of  the  cluster,  as  well  as  for  the  bicriterion  problems  with  split  and  radius  and  with  split 
and  diameter  criteria.  Detection  of  clusters  firom  lists  of  candidate  ones  is  discussed.  Computational 
experience  with  test  problems  from  the  literature  is  reported. 
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Embedding  Graphs  in  Disjunctive  Products  of  Lines 

Robert  Jamison 
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The  disjunctive  product  of  lines,  introduced  by  Strahringer  and  Wille,  provides  a  notion  of  ’’be¬ 
tweenness”  for  ordinal  data  in  several  variables.  This  notion  of  betweenness  leads  to  a  new  and 
rather  strange  notion  of  convexity  on  euclidean  space.  The  embedding  of  graphs  into  this  structure 
represents  an  attempt  to  capture  the  1-skeleta  of  polytopes  in  this  convexity.  I  will  discuss  several 
Steinitz  type  theorems  about  the  kinds  of  graphs  which  can  occur  as  polyhedral  1-skeleta. 
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Ordinal  Approaches  to  Cluster  Analysis 
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The  goal  of  cluster  analysis  is  to  suggest  the  possible  internal  structure  of  a  data  set  by  considering 
various  attributes  of  the  individual  objects  under  consideration.  The  usual  paradigm  converts  the 
attribute  data  into  a  measure  of  dissimilarity  (DC),  and  then  converts  this  intermediate  DC  into 
either  a  partition  or  a  nested  sequence  of  partitions  of  the  objects.  Of  course  if  the  original  attributes 
have  only  ordinal  significance,  it  is  hard  to  image  that  any  resulting  DC  will  have  more  than  ordinal 
significance.  But  even  if  the  attributes  are  of  scalar  or  interval  type,  the  intermediate  DC  often  still 
has  only  ordinal  significance.  Examples  of  this  include  the  use  of  product-moment  correlation  or  the 
cosine  of  the  angle  between  the  attribute  vectors  as  measures  of  dissimilarity. 

Ordinal  clustering  algorithms  may  be  naturally  classified  according  to  how  they  interact  with  order 
preserving  mappings  of  the  noimegative  reals.  This  leads  to  a  formal  context  whose  structure  is 
being  investigated  in  joint  work  with  Rudolf  Wille.  It  can  also  be  argued  that  the  appropriate 
cluster  algorithms  for  use  with  ordinal  data  (the  so-called  monotone-equi variant  cluster  methods) 
are  graph-theoretic  in  nature.  In  joint  work  with  Ralph  Stinebrickner,  it  has  been  shown  that  a  useful 
and  interesting  classification  of  such  clustering  algorithms  may  be  achieved  from  pure  graph-theoretic 
considerations,  thus  avoiding  any  into’action  with  any  mappings  on  the  reals.  A  brief  introduction 
will  be  provided  to  both  of  these  approaches. 
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Methods  in  Probabistic  Numerical  Identification  of  Bacteria 
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Probabilistic  numerical  identification  of  bacterial  strains  can  be  seen  as  joint  estimation  of  and 
classification  by  a  finite  mixture  of  multivariate  Bernoulli  diotributions,  [1,  3].  In  awidition  the  number 
of  classes  occurring  in  the  mixture  has  to  be  determined  from  data.  The  identification  of  a  strain 
X  =  with  binary  features  Xi,  means  assigning  x  to  the  class  Cj  that  maximizes  the  a  posteriori 

probability  P{cj  \  x)  estimated  prior  to  or  simultaneously  with  identification  from  data.  Due  to  a 
subsequently  increased  chance  of  error  the  identification  is  combined  with  a  rejection  option  known 
as  the  Willcox  probability,  see  [3].  We  analyse,  c.f.  [2],  the  performance  of  this  kind  of  techniques  by 
means  of  error-reject  curves  and  discuss  the  problem  of  identifiability. 
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Generalizations  of  Transitivity  for  Relations  between  Sets 

Andrey  V.  Malishevski 
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We  dead  with  binau'y  relations  given  on  the  power  set  of  some  basic  set  (finite  for  simplicity);  call 
them  Hyperrelations.  Apairt  from  the  usual  notion  of  the  transitivity  applied  to  such  kind  of 
relations,  we  consider  stronger  versions  of  similiar  properties,  calling  them  Hypertransitivity. 
These  versions  take  into  account  the  set-theorectical  structure  of  objects  connected  by  hyperrelations. 
Such  generadizations  are  useful,  e.g.,  in  models  of  logical  inference  or  dependencies  in  databases 
where  one  cam  infer  new  statements  tadcen  seperately.  Equivalent  representations  of  hsrpertransitive 
hyperrelations  are  suggested  which  use  auxiliary  fautnilies  of  representing  sets  or,  adtematively,  families 
of  orderings  on  the  bauic  set,  These  representations  actually  are  a  kind  of  generalization  of  the 
well'known  Szpilrajn>Dushnik-MiIler  representation  of  usuad  transitive  relations  (preorders  or  partial 
orders).  The  obtained  results  aure  applied  to  elucidate  mechanisms  of  generating  some  important 
types  of  set-to-set  mappings.  Two  opposite  cases  aure  investigated:  1)  expansions,  with  the  axioms 
of  abstract  closure  operators,  amd  2)  contractions,  with  the  so-cadled  Path  Independence  Axiom 
originated  in  decision  theory  au  applied  to  choice  functions.  Universad  representations  for  both  of 
these  cases  are  obtained.  The  relationships  with  the  E\iler-Venn  diagrams  amd  with  abstract  convex 
geometries  (amtimatroids)  are  established  as  well. 
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Mathematical  Classification  and  Clustering 


Boris  Mirkin 
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Mathematical  classification  theory  should  include  the  following  subjects:  classification  structures; 
models  connecting  classification  structures  with  data;  algorithms  to  fit  the  models;  interrelations 
between  those  models  and  algorithms;  interpretation  and  knowledge  generation;  interrelations  with 
other  mathematical  disciplines,  which  will  be  illustrated  with  an  example  based  on  an  approximation 
clustering  model.  The  paper  will  include  a  brief  discussion  of  each  of  the  above  topics. 
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Latticial  Theory  of  Consensus:  An  overview 
Bernard  Monjar det 
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The  consensus  problem  met  in  various  fields  (social  choice,  paired  comparisons  methods,  cluster 
analysis,  data  analysis  . . .)  is  to  “aggregate”  an  n'tuple  of  objects  into  a  —  or  several  —  consensus 
object(s).  Since  the  Arrow  axiomatic  approach  and  theorem  (1951-1957)  in  the  field  of  social  choice, 
many  “arrowian”  results  have  been  obtained  in  this  field  or  other  fields.  Three  years  ago  [5]  I  pre¬ 
sented  the  foundations  of  a  consensus  latticial  theory  able  to  account  of  many  of  the  results  obtaiined 
when  one  want  a  unique  consensus  object  (for  a  similar  approach  in  a  case  where  one  can  get  several 
consensus  objects  (see  [1]). 

In  this  talk  I  first  give  the  general  framework,  results  and  recent  developments  (especially  in  the  case 
of  valued  objects,  [3],  [4])  of  this  theory;  the  properties  of  a  dependence  relation  S  defined  on  the  set 
of  all  join-irreducible  of  any  (semi)iattice  plays  a  key  role  in  the  obtaining  of  several  of  these  results; 
then  I  show  that  this  relation  is  linked  with  the  weak  perspectivity  relations  (or  arrows  relations,  see 
[6])  defined  between  the  join  and  the  meet  irreducible  elements  of  the  lattice  and  I  give  a  report  of 
the  known  properties  of  the  dependence  relation  in  some  classes  of  lattices:  finally  I  consider  the 
generalization  of  this  theory  to  an  ordinal  context  and  the  connections  with  a  recent  work  in  this 
direction  [2]. 
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Predicting  Human  Memory  and  Judgment  Time  from 
Ordinal  Measures  of  Psychological  Proximity 


Kenneth  R.  Paap 

Deparment  of  Psychology  &  Computing  Research  Laboratory 
New  Mexico  State  University 
Las  Cruces,  NM  88003,  USA 


The  organization  of  knowledge  and  the  impact  of  this  organization  on  memory  and  judgment  has 
been  of  interest  to  cognitive  psychologists  for  a  long  time.  A  classical  way  of  representing  knowledge 
has  been  through  the  use  of  networks  where  concepts  are  represented  as  nodes  in  a  graph  struc  ire, 
and  the  relation  between  concepts  as  links  between  nodes.  The  Cognitive  Systems  Group  at  New 
Mexico  State  University  (Schvaneveldt,  1990)  has  developed  a  technique  called  "Pathfinder”  that 
empirically  derives  network  structures  from  proximity  data  such  as  relatedness  ratings.  Pathfinder 
connects  two  items  if  all  other  network  distances  between  those  two  items  are  greater  than  or  equal 
to  the  distance  estimate  for  that  pair.  Two  parameters,  r  and  9,  determine  how  network  distance  is 
calculated  and  affect  the  density  of  the  network.  The  parameter  r  defines  path  length  in  terms  of 
the  Minkowski  metric  and  when  r  =  cx),  path  length  equals  the  maximum  weight  of  the  links  that 
make  up  the  path.  Thus,  for  r  =  00  only  ordinal  assumptions  need  to  be  made  about  the  data. 

The  goal  of  this  talk  will  be  to  review  some  of  the  recent  evidence  that  supports  the  psychological 
validity  of  Pathfinder  networks  as  a  representation  of  human  semantic  memory.  For  example,  perfor¬ 
mance  on  a  cued  recall  task  is  influenced  by  whether  the  cue  and  target  are  linked  directly  or  through 
a  path  involving  a  second  node  (Bajo  &  Canas,  1992).  Performance  is  also  affected  by  the  number 
of  direct  links  emanating  from  a  cue  node.  These  effects  occur  even  when  the  cue-target  pairs  are 
equated  for  associative  strength  and  relatedness  ratings.  In  another  recent  study  Cooke  (1992)  has 
shown  that  Pathfinder  proximities  predict  response  time  in  category  and  size  judgments  tasks  better 
than  the  ratings  &om  which  the  networks  were  derived  and  better  than  proximities  obtained  from 
multidimensional  scaling.  These  successes  may  be  attributed  to  Pathfinder’s  emphasis  on  relatedness 
estimates  from  the  related  end  of  the  scale  that  tend  to  be  more  sensitive  to  differences  in  conceptual 
proximity  than  those  at  the  unrelated  end  (Roske-Hofstrand  &  Paap,  1990).  Pathfinder  discards 
these  least  sensitive  estimates  and  reconstructs  them  using  the  most  sensitive  estimates. 
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A  Logic- Algebraic  Approach  to 
Formal  Contexts  with  Partial  Information 

Piero  Pagliani 
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Burmeister  has  studied  a  logical  model  allowing  the  calculus  of  dependency  relations  among  attributes 
in  partial  binary  contexts,  that  is  in  contexts  K  =  (G,  A/,  /),  in  which  the  information  function  /  is 
just  partially  defined  over  the  direct  product  G  x  M. 

Since  these  contexts  are  only  partially  informed,  not  only  definite  questions  like  “does  A  depends  on 
S”  (A,  S  €  P(Af ))  make  sense,  but  also  “not  deterministic”  questions  like  “the  dependency  of  A 
and  B  is  True  or  Not  Definite,  but  not  False”  may  be  of  relevance. 

In  my  contribution,  I  provide  a  logic-algebraic  model  for  the  calctilus  of  attribute  dependencies  in 
partial  binary  contexts.  I  show  that  any  suitable  interpretation  of  the  notion  of  dependency  among 
sets  of  attributes  has  to  preserve  some  minimal  logical  requirements:  namely  the  compatibility  with 
Classical  Logic  and  a  stability  condition.  These  requirements  lead,  fitom  the  algebraic  point  of  view, 
to  Kleene  algebras  and  from  a  logical  point  of  view  to  the  Strong  Semantics  of  Kleene.  But  in  order 
to  be  able  to  obtain  non-deterministic  information  about  dependencies,  we  have  to  extend  Kleene 
algebras  to  semi-simple  Nelson  algebras  in  order  to  be  able  to  apply  the  “extended”  implication 
operations  -<  and  ID  that  I  introduce  in  order  to  supply  the  required  non  determistic  inform.  tion 
about  dependencies. 

Nevertheless,  via  a  new  intuitive  model,  that  1  call  the  “conveyed  information  model”,  it  is  shown 
that  in  a  sense  the  Kleene  logic  is  the  natural  logic  connected  to  partial  contexts. 

It  is  interesting  that  an  intuitive  analysis  of  partial  contexts  provides  in  this  way  a  “concrete” 
interpretation  of  the  dual  spaces  of  Kleene  algebru. 

More  precisely,  given  a  partial  context  K  =  {G,M,I),  I  obtain  a  structure  (G,  <,  /),  the  spectral 
space,  Spect{K),  of  the  partial  context  K*  =  (G*,  M*,  /*),  where  G  is  a  set  of  elements  which  represent 
the  general  information  behaviour  of  the  original  elements  from  G,  the  relation  <  represents  possible 
transmission  of  information  from  more  defined  elements  of  G  to  less  informed  elements  and  /  is  a 
track  linking  what  is  known  about  an  element  of  G  to  what  has  not  (yet)  been  defined. 

The  structure  (G,  <,  /)  reveals  to  be  the  dual  space  of  the  Kleene  algebra  produced  by  the  application 
of  an  operator  t  to  the  elements  of  M*  (that  represent  the  general  information  behaviour  of  the 
original  elements  from  M): 

ir({m})  =  e  G*  definitely  enjoying  m  €  Af*},  {g  6  G*  definitely  not  enjoying  m  €  M*}) 

In  support  of  these  results  there  is  the  fact  that  using  different  technical  and  intuitive  approaches  I 
have  obtzuned  essentially  the  same  conclusions  u  those  of  Burmeister  where  the  topic  was  originally 
studied. 
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Consensus  Methods  for  n-Weak  Hierarchies 

Robert  C.  Powers 


Department  of  Mathematics 
University  of  Louisville 
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Let  n  >  3  be  a  positive  integer.  Bandelt  et  al.  (1991)  defined  an  n-weak  hierarchy  H  on  a  finite 
set  S  to  be  a  collection  of  subsets  of  S  such  that  for  any  Ai, . . . ,  An  in  H  there  exists  an  index  j  such 


that 


n  Ai=  0  Ai. 

It  was  observed  by  Bandelt  (1992)  that  a  3- weak  hierarchy  is  a  generalization  of  a  prepyramid  (see 
Diday  (1986)). 

A  k-tuple  of  hierarchies  (/fi, . . . ,  Hk)  is  called  mixed  if  there  is  a  decomposition  {Aj  :  j  =  3, . . . ,  n} 
of  the  index  set  {1, . . . ,  A}  such  that  i  €  A,  implies  Hi  is  en  veak  hierarchy  for  t  >  j.  The  problem 
of  giving  a  well-defined  method  of  consensus  using  h-tuples  of  mixed  hierarchies  as  input  and  n-weak 
hierarchies  as  output  is  investigated. 
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Drawing,  Order  and  Surfaces 
Ivan  Rival 
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The  modem  theoretical  computer  science  literature  is  preoccupied  with  efficient  data  structures  to 
code  and  store  ordered  sets.  Among  these,  graphical  ones  play  a  decisive  role.  Loosely  speaking, 
graphical  data  structures  must  be  DRAWN  in  order  that  they  may  be  easily  READ. 

Besides  the  well  known  metaphors  inspired  by  layout  design,  project  management,  cuid  database 
design,  several  unexpected  application  areas  are  driving  our  recent  investigations.  These  include  ice 
flows,  fisheye  techniques,  molds,  and  quantum  topology. 

From  the  viewpoint  of  graph  drawing  there  is  a  common  thread  to  these  themes.  Each  involves 
an  ordered  set  and  each  views  the  ordered  set  monotonically  on  a  two-dimensional  surface.  We 
are  led  ineluctably  to  upward  drawings  on  an  orientable  surface  in  3-space.  Its  edges  are  drawn 
monotonically  with  respect  to  the  south-north  direction. 

The  tools  appropriate  to  this  analysis  come  from  topological  graph  theory  and  differential  topology. 
In  turn,  this  work  leads  to  new  algorithmic  considerations.  How  difficult  is  it  to  pentangulate  a 
covering  graph?  Does  every  ordered  set  have  a  cellular  upward  drawing?  How  to  approximate  the 
critical  points  of  a  surface  with  an  upward  drawing  of  the  same  genus? 
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The  Meaningfiilness  of  Ordinal  Comparisons 
for  General  Order  Relational  Systems 
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One  of  the  most  useful  concepts  of  measurement  theory  is  that  of  meaningfulness  or  invariance.  In 
generzd  terms,  a  statement  using  scales  of  measurement  is  called  meaningful  if  its  truth  or  falsity 
is  unchanged  if  all  scales  in  the  statement  are  replaced  by  other  acceptable  scales.  The  theory  of 
meaningfuless  has  been  applied  to  index  numbers,  averaging  and  merging  procedures,  combinatorial 
optimization,  psychophysical  scaling,  and  analysis  of  order  and  matching  experiments,  to  name  just 
a  few  of  its  applications.  The  simplest  statement  whose  meaningfulness  needs  to  be  understood  is  the 
ordinal  comparison  /(a)  >  /(6),  the  assertion  that  item  a’s  assigned  scale  value  is  larger  than  item 
b’s.  In  this  talk,  we  describe  the  systematic  attempt  to  understand  when  this  ordinal  comparison 
is  meaningful.  The  talk  will  concentrate  on  the  case  when  the  scale  arises  from  a  homomorphism  / 
into  a  general  order  relational  system,  a  union  of  m-ary  relations  induced  on  the  set  of  real  numbers 
by  rankings  of  the  integers  in  {1,2, . . .  ,m}.  General  order  relational  systenu  arise  from  the  concept 
of  m-point  homogeneity  which  plays  a  central  role  in  the  theory  of  scale  type  and  meaningful!  ess. 
The  talk  will  explain  the  connection,  and  then  will  describe  how  to  determine  if  a  homomorphism 
into  a  general  order  relational  system  leads  to  meaningful  ordinal  comparisons. 
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TOSCANA’l 

A  Management  System  for  Conceptual  Data 

Tammo  Rock 
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TOSCANA’l  is  a  computer  management  system  for  conceptual  data  processing.  It  allows  to  imple¬ 
ment  and  treat  conceptual  data  systems  built  on  large  data  contexts  and  data  bases.  Conceptual  data 
systems  (as  they  are  developed  in  the  research  group  on  formal  concept  analysis  at  the  TH  Darm¬ 
stadt)  visualize  conceptual  structures  inherent  in  the  data  by  nested  line  diagrams.  This  method 
of  knowledge  representation  can  be  effectively  used  for  focussing  and  browsing  in  large  conceptual 
structures.  TOSCANA’l  manages  these  procedures  by  so-called  conceptual  scales.  The  user  can 
choose  those  scales  according  to  his  interests.  By  examples,  it  will  be  demonstrated  how  to  use 
interactively  TOSCANA’l  for  different  purposes. 
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Simulation  Based  Inference  in  Econometrics 

Til  Schuermann 

AT&T  Bell  Laboratories 
600  Mountain  Avenue 
Murray  Hill,  NJ  07974,  USA 


New  techniques  in  simulation-based  inference  are  applied  to  joint  test  of  adaptive,  rational  and 
naive  expectations  using  categorical  business  survey  data.  We  assume  that  the  survey  responses  for 
expectations  emd  realizations  are  triggered  by  jointly  normally  distributed  continuous  latent  variables 
which  allows  for  the  construction  of  a  simple  likelihood  function.  However,  this  simple  likelihood 
contains  multi-(four)dimensional  integrals  and  requires  simulators  to  evaluate.  Simulated  maximum- 
likelihood  estimation  is  carried  out  using  the  smooth  recursive  conditioning  (SRC)  simulator  due  to 
Geweke,  Hajivassiliou,  Keane  and  McFadden,  a  method  which  is  consistent  and  has  low  variance. 
The  latter  is  crucial  when  maximizing  the  log-likelihood  directly  by  simulation.  Identification  of  the 
parameters  is  achieved  by  placing  restrictions  on  the  response  thresholds  and/or  the  variances.  We 
apply  this  method  to  micro-level  business  survey  data  from  Switzerland  and  the  United  Kingdom. 
We  find  that  we  can  reject  all  three  hypotheses. 
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Psychoanalytic  knowledge  is  based  on  highly  abstract  theoretical  assumptions.  Nevertheless  it  is 
a  practical  knowledge  which  helps  the  analyst  to  understand  his  patient,  to  steer  the  therapeutic 
process  by  his  interventions  and  to  cope  with  the  psychic  charges  of  the  emotionally  dense  interaction 
process.  I  want  to  show  how  the  knowledge  structure  of  experienced  psychoanalysts  was  explored. 
The  question  is  how  the  (inflexible)  abstract  knowledge  interferes  with  practical  knowledge  which 
must  be  highly  flexible  because  every  patient  is  different. 
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There  is  a  wide-spread  interest  and  utility  in  ‘‘putting  things  into  rank  order”.  For  that  purpose, 
the  method  of  pair  comparisons  is  to  be  preferred  to  the  intuitive  method  of  rank  order,  because  the 
latter  imposes  orderability  regardless,  sometimes  spuriously,  whereas  the  former  allows  separation  of 
non-orderable  from  orderable  domains  and  yields  rank  orders  only  in  the  latter  appropriate  cases. 

As  noted,  the  method  of  pair  comparison  allows  testing  of  the  hypothesis  of  the  orderability  of  a 
domain.  In  particular,  transitivity  is  necessary  for  simple  ordering.  One  might  choose  to  reject 
the  hypothesis  of  orderability  on  the  evidence  of  intrasitivity  provided  by  observed  circular  triads. 
On  the  other  hand,  if  the  given  domain  were  presumptively  simply  orderable,  to  save  the  main 
hypothesis  one  could  entertain  a  supplementary  hypothesis  that  the  circular  triads  had  arisen  from 
errors  of  observation.  If  one  could  identify  such  “errors”,  then  one  could  “correct”  them  to  remove 
any  intransitivities,  and  thence  establish  a  simple  order. 

This  paper  addresses  the  question  of  how  to  deal  with  circular  triads  via  a  theory  of  errors  of 
observation.  If  one  allows  “error”  then  in  general  any  one  of  the  pair  comparisons  made  may  be  in 
error,  one  can  obtain  any  one  of  n!  orderings  of  n  distinct  objects.  A  “corrected  solution”  in  such  a 
case  is  arbitrary.  More  restrictively,  however,  one  can  use  “waght  of  evidence”  to  rule  out  error  for 
some  pair  comparisons  and  to  indicate  otherwise  where  the  errors  may  lie.  An  algorithm,  justified 
by  several  theorems  of  the  theory,  serves  to  partition  the  space  of  data  possibilities.  There  are  five 
main  categories  of  data,  one  of  which  is  for  error  &ee  transitivity,  another  of  which  is  for  intractable 
intransitivities,  and  the  others  cover  various  degrees  of  amenability  to  unique  “correction”  to  full 
transitivity. 
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Data  analysis  always  uses  concepts  to  elaborate  interpretations  of  given  data.  Therefore,  a  formal 
treatment  of  (ordinal)  data  has  to  include  some  formal  understanding  of  concepts  and  conceptual 
relations.  Formal  Concept  Analysis  yields  such  understanding  by  mathematizing  the  philosophical 
view  of  a  concept  as  a  unit  of  thought  constituted  by  its  extension  and  its  intension  (comprehension). 
The  advantage  of  Formal  Concept  Analysis  is  that  it  unfolds  the  inherent  conceptual  structures  of  the 
data  without  data  reduction;  furthermore,  it  allows  an  informative  visualization  of  such  conceptual 
structures  by  line  diagrams. 

Ordinal  data  can  be  effectively  analyzed  by  methods  of  Formal  Concept  Anal3rsis;  this  will  be  demon¬ 
strated  by  examples.  Even  larger  data  sets  may  be  conceptually  represented  by  using  nested  line 
diagrams.  Recent  experiences  with  applications  in  different  areas  have  shown  the  need  of  Conceptual 
Data  Systems  which  provide  an  even  better  treatm«it  of  large  data  contexts  which,  in  particular,  al¬ 
low  focussing  and  browsing  in  conceptual  structures.  A  model  for  Conceptual  Data  Systems  has  been 
developed  during  the  last  three  years  including  a  computer  management  system  named  TOSCANA’l. 
First  promising  applications  show  that  implemented  Conceptual  Data  Systems  will  be  a  useful  tool 
for  Ordinal  Data  Analysis. 

References: 

P.  Scheich,  M.  Skorsky,  F.  Vogt,  C.  Wachter,  R.  Wille:  Conceptual  Data  Systems.  In:  0.  Opitz,  B. 
Lausen,  R.  Klar  (eds.):  Information  and  Classification.  Springer,  Berlin-Heidelberg  1993,  72-84 

F.  Vogt,  C.  Wachter,  R.  Wille:  Data  analysis  hosed  on  a  conceptual  file.  In:  H.  H.  Bock  and  P.  Dun 
(eds.):  Classification,  data  analysis,  and  knowledge  organization.  Springer,  Berlin-Heidelberg  1991, 
131-140 

R.  Wille:  Concept  lattices  and  conceptual  knowledge  systems.  Computers  &  Mathematics  with  Ap¬ 
plications  21  (1S^2),  493-515. 

R.  Wille:  Begrifflicke  Datensysteme  ab  Werkzeug  der  Wissenskommunikaiion.  In:  H.  H.  Zimmer- 
mann,  H.-D.  Luckhardt,  A.  Schulz  (eds.):  Mensch  und  Maschine  —  Informelle  Schnittstellen  der 
Kommunikation.  Universtatsverlag  Konstanz,  Konstanz  1992,  63-73. 


Saceml  IntwiwtioiMil  Cenftrtnca  on  Ordinal  Data  Analyiis.  Univaraity  of  MataachuMtU,  Amharat,  MA  01003,  Octobar  15-17.  1993 


Geometric  Representation  of  Ordinal  Contexts 

Uta  Wille 

PB  Mathematik 
Universitat  Giessen 
Arndtstr.  2 
Giessen,  GERMANY 


Ordinal  contexts  are  given  by  data  matrices  representing  attribute  values  for  certain  objects  where 
the  values  of  each  attribute  form  an  ordered  set.  A  geometric  representation  of  an  ordinal  context 
is  understood  as  the  combination  of  an  object  embedding  into  an  ordered  vector  space,  an  attribute 
embedding  into  its  dual  space,  and  an  attribute  value  embedding  into  the  ordered  field  of  scalars 
such  that  an  object  g  has  the  value  w  of  the  attribute  m  iff  (^m(x«)  =  for  the  corresponding  vector 
Xgt  the  linear  form  ^d  the  scalar  r^  (cf.  [2]).  In  1964,  D.  S.  Scott  gave  a  first  characterization 
of  such  geometric  representation  in  an  n-dimensional  vector  space  over  the  reals  for  finitely  many 
objects  and  n  +  1  attributes  (cf.  [1]).  Since  Scott’s  axiomatization  is  not  finite,  it  cannot  be  used  to 
decide  whether  a  geometric  representation  exists  for  a  given  data  context.  Nevertheless,  following 
Scott’s  approach,  a  representation  theorem  can  be  proved  which  guarantees  the  representability  of 
a  finite  ordinal  context  by  a  finite  number  of  axioms  if  the  values  of  each  attribute  are  distinct. 
An  example  will  clarify  the  content  of  the  representation  theorem  and  further  questions  concerning 
geometric  representation  of  ordinal  contexts. 
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A  method  to  determine  the  size  of  a  classification  tree  is  proposed.  This  method  is  based  on  the 
change  of  this  apparent  misclassification  rate  (AMR)  of  the  tree  at  each  growing  stage.  The  method 
is  simple  and  fast  compared  to  the  other  classification  tree  methods  which  are  based  on  minimizing 
a  cost  complexity  function  (Breiman  et  al.,  1984).  To  test  the  method,  it  was  used  to  classify  species 
of  fungi,  and  the  results  are  in  good  agreement  with  those  obtained  by  linear  discriminant  analysis. 
Also,  21  proteins  with  known  structures  and  functions  were  classified  using  the  proposed  method. 
For  this  purpose  the  coefficient  of  variation  (CV)  for  several  properties  of  the  secondary  structures 
of  these  proteins  has  been  used.  Again,  the  results  were  in  good  agreement  with  the  classification 
obtained  by  Orengo  et  cd.  (1992)  using  dynamic  programming. 


